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WHAT IS CLAIMED IS: 

til. A method for clustering a plurality of items, each of the items including 
information, guided toward an initial organization structure, the method comprising: 

inputting a plurality of items, each of the items including information, into a 
clustering processs; 

inputting an initial organization structure into the clustering process, the initial 
organization strucmre including one or more categories, at least one of the categories being 
associated with one W the items; 

processing Using at least processing hardware the plurality of items based upon at 
least the initial organization structure and the information in each of the items in at least the 
clustering process; 

determinikg a resulting organization structure based upon the processing, the 
resulting organization structure more closely resembling the initial organization structure than if 
an empty organization structure or an alternative initial organization structure had been input into 
the clustering process; and 

storing the resisting organization structure in memory. 

2. The method of claim 1 wherein the processing comprises determining a 
hkeness level between a first item and a second item, the likeness level between two items 
increased if they are both similar to^items in one or more of the categories of the initial 
organization structure. 

3. The method of Alaim 2 wherein the determining the likeness level between 
the first item and the second item comprasing: 

associating a first feature wctor with the first item and a second feature vector 
with the second item, each feature vector representing information associated with each item; 

adding a first additional feature and a second additional feature to the first feature 
vector and the second feature vector of the first item and the second item, respectively, the first 
additional feature representing a first category of the initial organization structure and the second 
additional feature representing a second categoryiof the initial organization structure, the first 



9 additional feature moviding a degree to which the first item is similar to one or more items in the 

1 0 first category of the initial organization structure; and 

1 1 calculating a degree of similarity of the first item and the second item including 

12 calculating a similarii^ measure between the first additional feature and the second additional 

13 feature. 

1 4. T\ie method of claim 1 wherein the resulting organization structure 

2 includes a portion of the initial organization structure and at least one additional category 

3 coupled to the initial organization structure. 



1 5. The method of claim 1 wherein the resulting organization structure relates 

2 to the initial organization structure based upon a category similarity. 

h \ 

^1 6. The method of claim 1 wherein the resulting organization structure relates 

to the initial organization structure based upon a similarity of hierarchy structure. 

SJ \ 

%i 7. The method of claim 1 wherein the item is a document, a product, a 

person, a DNA sequence, a purchase transaction, a financial record, or a species description. 



1 8. The method\of claims 1 further comprising outputting the resulting 
'^2 organization structure on an output device. 

M 9. The method ofV:laim 1 wherein the processing hardware uses at least a 

2 500 MHz clock to efficiently run the cmstering process. 

1 10. The method of clalpi 1 wherein the plurality of items includes at least 

2 10,000 items. 

1 11 . A computer aided information organization device, the device including 

2 one or more computer memories, the one or more computer memories including: 

3 a first code directed to inputting at least 10,000 items in electronic form into a 

4 clustering process, each of the items including information; 

5 a second code directed to inputting^ initial organization structure in electronic 

6 form into the clustering process, the initial organization structure including one or more 

7 categories, at least one of the categories being associated with one of the items; 
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8 a third codeydirected to processing using at least processing hardware the plurality 

9 of items based upon at least pe initial organization structure and the information in each of the 

10 items in at least the clustering process; 

1 1 a fourth code airected to determining a resulting organization structure based 

12 upon the processing, the resulmng organization structure more closely resembling the initial 

13 organization structure than if an empty organization structure or an alternative initial 

14 organization structure had beern input into the clustering process; and 

15 a fifth code directed to storing the resulting organization structure in the one or 

1 6 more memories or another memc 
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12. The device of claim 1 1 further comprising a sixth code directed to 
determining a likeness level between a first item and a second item, the likeness level between 
the first item and the second item increases if they are both similar to items in one or more of the 
categories in the initial organization Structure. 

13. The device of claims 12 wherein the sixth code directed to determining the 
Hkeness level between the first item ancfithe second item comprising: 

a code directed to associating a first feature vector with the first item and a second 
feature vector with the second item, each ^ature vector relating to information associated with 
the item; 

a code directed to extending ftie feature vector of each item with an additional 
feature representing a category of the initial organization structure, the additional feature relating 
to a degree to which the item is similar to one ^r more items in the corresponding category of the 
initial organization structure; and 

a code directed to calculating a measure of similarity of the first item with the 
second item including calculating the similarity mi^asure between the first additional feature and 
the second additional feature. 



1 14. The device of claim 1 1 wherein the resulting organization structure 

2 includes a portion of the initial organization structure ais}d at least one additional category 

3 coupled to the initial organization structure. 



29 



4 ill • • 



1 15. Vhe device of claim 1 1 further comprising a sixth code directed to 

2 outputting the resulting organization structure, the resulting organization structure including a 

3 plurality of categories. 

1 16. Tne device of claim 15 further comprising a seventh code directed to 

2 inputting additional itenk using the resulting organization structure. 

1 17. The device of claim 1 1 further comprising a sixth code directed to 

2 independently modifying me resulting organization structure using a graphical user interface. 

1 18. The device of claim 17 wherein the independently modifying is provided 

/2 by a user coupled to the graphical user interface. 
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